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FOREWORD 



This paper examines ihe problem of measuring and evaluating teacher performance. Evaluation methods currently in 
use are reviewed and a recommendation is made for the implementation of an approach to evaluation of teacher 
effectiveness. The authors. J. Marvin Cook and Richard F. Neville, denne the concept of teacher effectiveness as the study 
of teaching outcomes. In that context they analyze the relative merits of measurement based on student performance 
(direct measurement) and measurement based on teaching activities (indirect measurement) as they relate to the evaluation 
of faculty. J. Marvin Cook is an associate professor and Richard F. Neville is division chairman and professor at the 
University of Maryland, Baltimore County, Division of Education. This paper is based on a study performed by Dr. Cook 
tor the faculty senate at the Baltimore County campus. 

This paper is the thirteenth in a series of reports on various aspects of higher education. These reports are based on 
recent significant documents found both in and outside the ERIC collection. Additionally, current research on higher 
education is abstracted^ and indexed for publication in the U.S. Office of Education’s monthly volume. Research m 
Educauan. If you wish to order ERIC documents cited in the bibUography, send your request to ERIC Document 
Reproduction Service, Post Office Box Drawer 0, Bethesda, Maryland 20014. The ERIC document (ED) number must be 
given when ordering. Payment for microfiche (MF) or xerox/hard .copy (HC) must accompany orders ot less than $10.00. 
Docui.ients are $0.65 for microfiche and $3.29 per hundred pages for hard copy. AH orders must be in writing. 
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I. INTRODUCTION 



Now, more than ever before, the public at large and the 
young in particular pose peretrating questions about the 
quality of teaching in higher education. Russell Cooper 
(1970) notes “teachers are called upon to be accountable 
for the effectiveness of their pe formance as teachers,'’ to 
defend their instructional objectives, and to develop new 
styles and methods suitable to leach large. nunibers of 
students of disparate backgrounds and expectations. In 
this context, this paper focuses on the evaluation of the 



university professor in his role as teacher: other , areas ot 
faculty performance - research , publications , and 
service— are not emphasized. The paper is comprised of 
five parts, including part L the introduction. Part II assays 
the problem of measuring teacher performance; part III 
gives a summary of evaluation methods in use at colleges 
and universities across the country; pari IV recommends ^ 
specific models to evaluate teacher effectiveness; and part ^ 
V contains a selective bibliography. 




II. the problem of measuring 

TEACHER PERFORMANCE 



Teacher performance evaluation has motivated literally 
thousands of studies. Gage (1960) found that literature 
about teacher effectiveness was so prolific that bibllogFa* 
phies on thi subject bordered on the unmanageable. 
Another researcher, Tyler that: 

In the American college -immunity the evaluation of 
teaching is as common as the judgmerfl oi' the quality of 
dormitory food- and often as objective. Trolessor Smith 
is a wonderful teacher!* *We are^a select college, proud ot 
our tradition of exceilence in teaching.* Comments like 
these are part of the typical pattern of college conversa- 
tions. Yet wc know they are not highly valid, objective and 
impartial appraisals. Sound and systematic evaluation of 
college teaching is exceedingly rare and yet it is hi^ly 
essential to the Improvement of college Instruetioni 

Identifying where the teacher is successful and where 
he has .found difficulties offers a starting point for 
directing efforts toward Lfnprovernent, 'Hie Committee on 
> College Teaching of the American Council on Education 
(1961) surveyed seven types of hi^er education institu- 
tions to examine, their policies ^nd practices in faculty 
evaluation. The Committee fourid “without exception, air 
seven types of institutions said that classroom teaching 
was the most important factor in evaluation” of faculty 
members. Faculty members frequently comment on the 
fact that while the university “states” a commitment to 
effective teaching, what really counts is research produc- 
tLvity and. publication , It would appear that without a 
sound basis for studying teacher effectiveness, professors 
‘ will direct their efforts toward those activities that they 
believe will he rewarded by their institution, 

it is generally conceded that to become an effective 
. , » -' teac something more must be considered than the 
results of a trid-and-error approach or fortuitous 
O iccesses-hypotheses about effective or ineffective teach- 




ing can become principles only when the reasons for 
effectiveness are carefully examined. There are at least 
three salient reasons for establishing teacher effectiveness 
guidelines: 

• Every teacher would benefit froin a systematic 
appraisal of his or her efforts, 

^ • Teaching-along with research and service-is one of 
the three basic activities of a college or university 
professor and must be effectively performed. 

• Basic criteria for teacher performance need to be 
developed to ensure that faculty are properly 
evaluated. 

Beveloping a methodology 

If a list of consistent predictors of teacher, effectiveness 
were developed, qualified observers could enter a class- 
room , note a teacher’s perTormance, and measure the way 
he prortiojes learning. From the data obtained, it would 
be possible to predict what traits or skills a teacher needed 
to assure jthat his students learned what he wished to 
teach them; in other 'words, ’it would then be possible to 
predict the teacher’s degfee of effeciivemss. However, in 
spite of prodi^ous research in the area, Biddle (1964) 
reports that “few if any teacher traits, ikUls, or method- 
ology h^ve been identified by research as good predictors 
of teacher effectiveness ” 

To establish specific traits, skills, of methodology as 
predictors of teacher effectiveness in classroom situations, 
an investigator needs to gather data by direct observation, 
and to record teaching performance by categories of 
teacher behavior. The resulting profile could then be" 
related to student performance in specific areas of 
instruction. Unfortunately, as Biddle (1964) emphasized, 
observation of teacher behavior is a costly and tedious 
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process, wiih many variables affecting both student and 
teadier. One such varial is that a teacher's classroom 
performance is separated froni student achievement by 
whatever^jspan of time it takes to measure that achieve- 
ment. a’ further complication is that most persons doing 
research on teacher effectiveness have not dealt with 
behavioral observation. The lack of data in iliis area is 
mentiofted by Rosenshine (1970): . .arriong hundreds 

of research and evaluation reports at the ERIC Clearmg- 
house on Early Childhood Education, [one researcher] 

. . . found only ten observational studies reported since 
I960”; furthermore, Rosenshine notes that of 46 science 
projects on the higher educational level, only one included 
an observational analysis. In the same study, Rosenshine 
also points to the difficulty of meaningfully correlatinr 
data between teacher performance in the classroom an J 
student achievement: 

Compared to tlie large number of descriptive studies, there 
have been relatively fesv studies of the relationship between 
measures obtained by the use of observational systems and 
measure of class achievement adjusted for initial aptitude or 
abiFity, Approximately forty studies were completed in this 
area. These studies are difficult to synthesiz.e for at least 
three reasons: they varied widely in subject area, grade, and 
observational instruments; som,c studies used statistical 
procedures which were not appropriate; and in many 
studies the number of classrooms observed was less than 
twenty. Two attempts have been made to review some of 
the research relating classroom behavior to student achieve- 
ment. . ,but because of the above difficulties it is too early 
to identify relationships that can be stated with any 
confidence. 



Defining relationsriips—a first step 

There are at least two reasons for the difficulty of 
measuring teacher effectiveness; first, the inconsistent 
definition of terms; .and second, an inability to relate 
definitions, once agreed upon, to a methodology for 
evaluating teachers. In the past, the phrase “teacher 
effecti\^ness” has too often been used as a synonym for: 
traits of teachers; classroom skills exhibited by teachers; 
student ratinp received by teachers; classroom interaction 
or participation piOduced by teachers; classroom climate 
established by teachers; as- well as student achievement 
effectuated by teachers. The same factors have been called 
“competence,” “criteria for competence,” “ability to 
teach,” “value of a teacher,” and others. In this paper, a 
distinction , is made between teaching traits and skills, 
“effectiveness,” “rating,” and “evaluation.” The following 
definitions incorporaff these discriminations: 

• Teaching functions refer to the categories of the 
role of the teacher as a facilitator of learning. These 
categories are established on the basis of% die 
different ways a teacher relates to the studeht^s 
learning proce^. For example. Tliere might be three 
teaching functions ‘ which the teacher fulfills as a 



fucilitutur uf Icurning; diugnostician. prescijbci . and 
enabler, 

• Mea^iremenr of teaching train and skills (including 
methodology and academic competence) is the act 
of dcterniining what and how dirfercnl trails and 
skills f methodology) are used by a teacher for the, 
different teaching functions ■ 

• Teaching effectiveness is the ability of a teacher to 
facilitate the acquisition by his students of those 

■ ways of thinking, feeling, and acting that have been 
previously defined as his instructional objectives. 

• Mcasurcvicnt of teaching cffcciircncss is the act oi 
measuring the extent to wltich students Imve learned 
what the teacher is trying to teach. 

• Inference refers to the process intervcfiing between . 
the objective data seen or heard by an observer and 
the coding of those data on a classroom observa- 
tional iiistruinent. 

• Raiing a teacher is n procedure by wliicli ihe raler 
observes the teacher's per forma nee ir] a classroom 
and then infers from liis perception a series of 
such performances whether or not (lie teacher is 
“enthusiastic,” “conununicates clearly,” '1s rap- 
port with students.” etc. Such ratings yield general 
impressions which luck specificity. 

« Delineating a set of values involves the act of identi- 
fying the set of justifiable values that is to be super- 
imposed upon a set of collected data. 

• Dvahm. mg a teacher is the act of weighting the 
measured data obtained about a teacher by superim- 
posing a set ofjalues onto the data. 

These definitions, by distinguishing between the 
various acts .in -the teacher evaluation process, should . 
remove a priniaiy roadblock to solving the problem of 
teacher evaluation. Figure I displays these definitions 
hierarchically to clarify their functional interrelationships. 
Referring to the illustration, teacher evaluation can be 
seen as consisting of two majgr components: a data source 
of teacher effectiveness, and tliQ set of values us^ to 
judge the data. When the measurement data has been 
obtained on a teacher's effectiveness, and a seb of values 
has been decided upon by a promotion or tenure commit- 
tee, the process of evaluation consists of weighting the 
measured data by superimposing a set of values predeter- 
mined by the committee onto the data known about a ' 

' teacher. 

For example, suppose the data collected showed that a 
teacher’s students learned what he intended them to learn; 
and suppose what he intended thith to learn wera,Hsted as 
instructional objectives A, B, C and D. If the set or values 
of the promotion and tenure committee were such that 
objectives A, B and C were not acceptable instructional 
objectives, then the superimposition of these values onto 
the data collected about the teacher would give the 
teacher credit only for' having tau^t objective D. Of 
coime, the outcome in this example coidd have been 
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Figure 1 . The relationship of the different components in the act of 
evaluating professors in the teaching role at a unh- lity. 






prevented by making clear to the teacher what instruc- 
tional objectives were to be included in the evaluation. 

As shown in Figure 1, the. data source /of teacher 
effectiveness can be divided into two distinct types of 
measufements: direct and indirect. A d^ect measurement 
quantifies the effects of teaching and can be defined as 
the extent to Which students have learned what the 
teacher is trying to teach as deterniined by the teacher ^ his 
department and other qualified judges. An indirect 
measureffwnt h' ^ measure of what the teacher does* to 
facilitate learning; for example, the teacher fleets instruc- 
tional objectives, selects course materials, establishes a 
learning ‘ environment, prepares academically, organizes 
and presents materials, dia^ose^ students, interacts with 
students, etc. . . ■ = 



Indirect measurement 



Based on Figure 1 , both student rating of teachers and 
the measurement of teaching traits and skills are seen to, 
be indirect measures of teacher effeciiv^ne^. To use such 



data a correlation must be established between what a 
teacher does to facilitate learning and what students do as 
Y^isult. of a teacher’s activities. Correlating the activities 
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of a teacher as a facilitator of student aclilevement is no 
easy task. It must be done if indirect measures are to be 
used to measure teaching effectiveness. 

The indirect m^ure in Figure 1 is divided into high- 
and low-inference measures. The following comments by 
Rosenshine (1970) explain these two elements and high- 
light their advantages and disadvantages:, 

Instfuinints for the ob^rvation of instniction are currently 
divided into category systems and rating systems. This 
division is based on the amount of inference requi^d of the 
observer or of the person reading the reseejch report. 
Inference here refers to the proee^ intervening between the 
objective data seen or heard and to the coding of those data 
on an obsen^ational instrument. Category systems are 
classified as iow> 'mference measures because the items focus 
upon specific, denotable, relatively objective behaviors such 
as “teacher repetition of student ideas,” or “teacher asks 
" evaluative qucstion[,)” and because Uitie events are re-, 
corded as frequency counts. Rating systems are classified as 
high‘inf$rence measures becau^ they lack such specificity. 
Items on rating instruments such as “clarity of piesenta- 
tion “enthusiasm,” or “helpful toward students” require 
that an observer infer these constructs from a series of 
events. In addition, an observer must infer the, frequency of 
such behavior, in order to record whither jt occurred 
“consistently,” “sometimes,” or *^ever ” or whatever set 
of "^^aduations luc used in the scale of an observation 
instrument. To a reader, the statement that a teaser 
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repeated HiudcnLs answers 1^^ oI ihc time is much more 
speciriw than the statement that a teacher wa?, sornctimes 
hdphil toward students. 

Category systems have become very popular in descriptive 
educationai research and in teacher training because they 
oner greater low-inference spccttlcity and because an 
“objective’' of a teacher's encouraging statements to 
Students appears easier for a teacher to accept than a 
’’subjective” ratttig of his warmth. The major disadvantages 
of category systems are the cost of using observers and the 
difficulty of specifying behaviors before they can be 
included in a category system. 

Rating systeins offer greater flexibility than category 
systems because they can include high-inference variables, 
Rating Sj HtcrriS Can also be cxpcriSivc if the Students in 
the classroorris are used as observers. , , ,The disadvantages 
of using rating sy stems are summarised by Mouly (1969): 
they include the halo effect, the error of central tendency, 
generosity or leniency error, and the lack of a common 
referent for scoring calibrations such as “excellent” or 
“seldom.” Another disadvantoge, noted by Gage (1969), is 
that higit“ii!ierence items are diftlcult to trunsiate inti> 
specifle behaviors. This suggests that evaluative reports 
based on highdnfcrcncc measures may offer few specific 
suggestions for improving an instructional program. An 
evaluative report which suggests that teachers need to 
improve their clarity and organiiation, without giving the 
low-inference correlates of such behaviors, may amount to 
little more than .suggcsling that the teachers be ‘good and 
virtuous.’ 



It is a mistake to perceive specific acts of teaching and 
learning as being in a simple cause-and-effect relationship. 
Teaching and learning are a; process comprised of many 
interactions betw en teacher and student. The teacher 
receives responses from the student and reacts to these 
responses in ways intended to induce learning. Concur- 



rently, the student receives responses from and reacts to 
the teacher. This cycle of giving and receiving instruction 
is represented by Smith (19SQ as follows; 



\ 




Where- 

Pj ^ the teacher's perGeption of pupil behavior; 

= the teacher’s diagnosis of the pupil's state of 
^ interest, readiness, knowledge; etc., made by 
inference from pupil behavior; 

Rj = the action taken by the teacher in li^t of the 
diagnosis* 

^ - the pupil’s perception of the teacher’s behavior; 
the pupil’s diagnosis of the teacher's interest, 
.. etc., as inferred from die teacher’s behavior; 

Rp» the reaction of the pupil to the action of the 
teacher. 



process suggests why it is difficult to obtain a positive 
correlation betW'een specific traits or skills of teachers and 
specific learning by students. Other factors that incrciise 
tliis difficulty include the cumulative nature of learning 
described by Gagne fl965) arid, as lias been noted, inc 
fact that a lime lag often exists between the act of 
teaching and the measure of student achievement. 

Various preconceptions among students about the 
teaching-learning situation must be taken into account as 
well. Each student brings to his environnieni a unique set 
of criteria, such as personal needs and beliefs, Jhat act as 
intervening variables and add to the difficulty of identi- 
fying and using specific traits and skills as predictors of 
teacher effectiveness. Young fl970) observes that since 
each classroom situation is unique in itself, a unique set of 
conditions is necessary before learning can occur; in other 
w'ords, the skills a teacher employs in one situation may 
not be suitable for other or even similar situations, Biddle 
(1964) comments on this problem: 

Some teachers may be inspirational leaders, others warm 
counselors, and still others walking encydopedias. In 
certain contexts, each of these competencies [skills) may 
be highly effective, in others each might have little or 
negative effect, 

For example, consider the teacher trait of warmth. It 
often has been assumed That “warm” teachers have a 
positive effect upon students, and that “cold” or “indlfi 
ferent” teachers have a negative effect. But is this 
universally true? “Warmth” may have a positive effect 
uponi first graders, but how about in terms of university 
students? Does “warmth” have the same value in 
respect to students with disparate backgrounds and 
expectations? Is “warmth” equally effective in the class- 
room, the laboratory, the office, etc.? Is “warmth” 
suitable to certain subject orientations— social science in 
contrast to a course in ^ology? 

Because the problem'^ of correlation is complex, the use 
of indirect measurement data obtained on a teacher’s 
performance should be used only for descriptive 
purposes=and not as a measure of lus effef'tiveness. Until 
positive correlations can be made between specific teacher 
activities and student achievement, ratings based upon 
observation by students, colleagues, or trained observers 
should not be considered adequate measures of teaching 
effectiveness. 

Direct measure 



Not&i Double vertical lines mark off instances of the 
^ complete cyEla; single vertical lines divide the cycle 

into acts of teaching |p^“. .. rJ or acts of taking 
instruction jPp . . . Rpj ; - 

s ’ 

As illustrated above, learning new ways of thinking, 
feeling, and acting results from the student’s participation 
in an exchange with the teaqher. The tenuoushess of this 

O 



The complexity of the task of evaluating teachers can 
be substantially simplified when a direct measure of 
ceaeher effectiveness is used. Wide many factois impinge 
on the beha^or of the student, the teacher doekhava an 
extended opportunity to effect a change in ^©-^tudent:,’ 
competence levels, and this change in competence 
becomes a relatively accurate barometer of a teacher’s 
effectiveness. If a value is placed upon students acquiring 



the-^instructional goals. Uicn ihe task ol evaluating a 
icachoi, as noted by Tyler (l^bO). is that ul dcternuning 
•'the extent to which the purposes- oi the Iteachinej 

activity arc actually realised. 

At the National Conferc-nce on Teacher bvaluation. 
Herbert Hite (19o8a) noted that the object ot teaching is 
to bring about learning, and stated: 

when Iteachersl teach papils. the pupils should demon- 
;,rate learning. . . But I don’t think all that's teamed ,s 
taught. InaMnuch as we tedcllt wc have purpose, and 1 think 
that without purpose you don t have ttae nng^ 

The object, then, is for thc^khers to define the evidence 
■ mat they'll accept as proof\t%t this learning has taken 
place; and then to arrange mH^tors so that tne mdiv.dua. 
learner does demonstrote this evidence, 

Rosemary Pierrel (n.d.) of Brown University supports this 
•attitude and expands upon it by suggesting that undeily- 
ing any measure 6f effective teaching must be specinca- 
tion of the desired end behaviors, both in the classroom 
and hopefully, well beyond the college years. These end 
behaviors should be defined in testable terms and would 
require observation of the students as well as the teacher. 
Ideally, Pierrel would require follow=up studies of alumni 
to dcterniine long-term retention of learned behaviors. 
Her position is encapsulated by this remark: “Not only is 
what the student }wars- more important that what the 
professor says, but it is what the student t/oes about it and 
continues to do about it, that demonstrates liis protessor s 
effectiveness."’- 

Biddle (1964) concurs with Hite and Pierrel: teacher 

effectiveness [is] the ability of a teacher to pro4uce 
agreed-upon educational eftects in a given situaUoii oi 
' context,’’ These ‘educational effects are those desired by 
tlie teacher and may be exhibited by the' students during 
tile teaching process as well as at the end. This direct 
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approach to measuring teacher 
expressed by Tyler (IboO): 

What o involved in evalii:Uin!! icachitiy,’ In essence, the task 
is similar to that of evaluating any other purposelul .wtiv. 

Uy I, ,‘s one of fmdinii out the extent to which the pui- 
-oses of tlie aetivitv arc .actually reali/ed. This seems simple 
, noush M but as we examine the, purposes ufteaehmg. we 
fu.d certain complications. The punmses ot tcaehi/igate to 
fadlitate v.-tinu,- kinds of desirable karnme on tne put ol 
the students. Hence, the evaluaium o) leaehnt! tn.-u/.c, 
appraising h'C learning a) swJcm Hut learning is the 
- t^quisition b the student of ways ot behaving, th.,t is. 
ways of thinktng. feeling and acting, which he has not pre- 
viously followed. Thus , a eollege student may develop an 
understanding of the physical structure of the atom, an 
understanding which he did not possess before, or he may 
acquire the abUity to analyze a proDiem m pia... v.o. 
Which he could not do betote, or he may develop greater 
skill in reading literary works ’than he had before, or he mav 
acquire intellectual or aesthetic interests which he did no 
have previously. These are a few random 
kinds of lc.aming which college teachers may seek to bnng 
about in their students. 

Of course not aU things students learned in college are 
desirable, so that the evaluation of college teaching is rao 
simply finding out whether students have learned, but 
whether thev hare learned the things winch the instnictors 
were Irving /<? teach. Students may learn bad habits as well 
as good ones, they may acquire misintormation as well as 
sound understanding, they may acquire a d^astc as well as 
an interest in v;mous intcllecfual and aesthetic fields and 
these, too. are cases of learning, but not cases of the learn- 
ing which coUcic teachers uc trying to bring about. 
Furthermore, since the coLlege years ate relatively short lar 
. too short to learn all that we wish could be ,carned- 
teachers must make a fairly rigorous selection of those 
kinds of behavior.'that is, of those ways of thinking, feeling 
and acUng. that they ■consider most important for students 
to acquire. This, then, makes the evaluation of teaching 3 
matter of finding-out how far the students are acquiring the 
important ways of thinking, feeling and acting c,„ which the 
teachers ire focusing their attention. 
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In the previous section, two basic methods to obtain 
data for the evaluation of university teachers are noted. 

> One method is a general assessment of the teacher s abLi y 
to motivate learning-the indirect method- This method 
would rate teachers by colleagues’ and studpts judgmen 
of the teacher’s performance and by a trained observer 
using a low-inference category iijstrument. This indirect 
evaluation method has validity as a measure of v*at a 
teacher does, but is not valid as a measure of the results of 

\ ' -a. ^ 

teaching. * 1 ^ 

The second method - direcf method - seeks to mea- 
sure a teacher’s, effectiveness by whether students have 

learned ways of thinking, feeling, and acting that the 
instructor was trying to teach. In contrast to theTre- 
quency' of the student-colleague rating procedure used on 
other campusis, This method is not known to be used at 

this time at the college or university level. 



The indirect method-evaluating teaching ability on the 
basis of a general assessment by student cr colleague 
' rating procedures-is used most frequently on campuses 
across the couhtry. Most often, the rating is done by stu- 
dents based upon their perception of the teacher. There 
appears to be considerable agreement among students, 
faculty, and administrators at the point of rating teachers. 
If students rate a 'teacher lugli or low on » 
teaching skill (e.g., clarity^ of presentation), it is higWy 
probable that the teacher’s colleagues as well as adminis- 
trators will rate him sirrularly, , e „ 

The use of student-devaluations of teachers perfor - 
ance has greatly increased during the past decade; hovv- 
■ ever the use of student evaluations is not new. Harvard s 
mnm\ Confidential Guide to courses was first published 
in 1924, and student evaluation began at the University o 
Washington in the same yeu.. T large coUection ofeurrent 
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qujsrionnaires is mainraincd by iriu National Siudcni 
Assuciali'on (NSA) at its Washington lieadquarters. An 
extensivp appendix of diHcfcnt kinds of questionnaires in 
use at a wide range of colleges and universities is included 
in Philip Werdeirs book for NSA, Courst- and Teacher 
Tvuluutiim (1067), The rtvailability ofsiich questiorujaires 
is a definite asset when a university considers establishing 
a student rating system for evaluating teachers. 

A report on the evaluation of teaching sponsored by 
the Academic Senate of the Davis Campus, University of 
California, yielded five factors that are common to most 
rating forms used by students and faculties. These factors, 
epitomizing desirable teaching traits or skills, are listed 
below: 

# Analytic ! Synthetic ^ j 4 pproflc/i— scholarship, with 
emphasis on breadth, analytic ability, and con- 
ceptual understanding. 

# Organization j Clarity at presentation, but is 
subject related, not student related, and is not 
merely rhetorical skilk 

# Instructor^Group Interaction— TdL^j^oxi with the class 
as a whole, sensitivity to. class response, and skill at 
securing active class participation. 

# instructor-individual Student Interaction ^mutual 

respect and rapport between the instructor and the 
individual student. ' 

# Dynamism/Enthusiasm— the • flare and infectious 
enthusiasrp that comes with confidence, excitement 
for the subject, and pleasure in teaching. 

Universities wishing to create a student rating form could 
well begin with this list of definitive categories. 

Jointly sponsored by the American Association of 
University Professors (AAUP) and the Association of 
American Colieges, who are supported by a grant from the 
Carnegie Corporation, a 2-year program entitled “Project 
to Improve College Teaching” began its formal existence 
September 1, 1969, from offices at the AAUP national 
headquarters, Washington, D.C. The director of the 
project, Dr. Kenneth E. Eble, traveled to a number of 
colleges and universities to learn first-hand about promis- 
ing programs that attempt to measure competency in 
teaching functions. The following account of approaches 
on various campuses is from that report (Eble (1970)): 

Princeton 

In the first stage of a niot Study in evaluation at Princeton, 
four question nairas were given to a sampling of courses and 
^ studenta in Biology, PoUtici, Electric^ Engineering, and 
' French. The questionnaires consisted of a course evalua- 
tion, a departmental evaluation, an evaluation of the total 
underpaduate expeilence, and an evaluation of tile' first 
two years. • ’ ‘ 

The results from tlte four departments were so proniising 
that in the sacond stage of the Pilot Study the question’' 
nakes ware pven to aU' undergraduates in all courses in the 
university. Some of the conclusions resulting from this 
experience are worth quoting: 

o 



\. Oul expeficiKC ha^ SC i dl Tcsi OUT niisslvings Jbout 
student interest in such questionnaires and about the 
seriousness with which they would be treated. 

“2. Though wc iuive taken no poll, our experience with 
the comments and iu tiers of the faculty show that 
those who have reported to us have found the 
repofl.s of thclf couHiCS helpful. 

... we are convinced tliat the administration of ques- 
tionnaires of the kind we have tried are important in 
giving students a sense of greater participation in 
their education.’ j 

On the basis of this extensive study, Princeton has incorpo- 
rated the four questionnaires into the regular educational 
process. The course evaluation the present time is a mod- 
ified open system with tlie machine print-outs for individ- 
ual courses available to the instructor and his department 
chairman, and to a central fUe available to students and 
otlicr members of the university. 

Thu University of Washington 

Students evaluation of teaching has been going on at the 
University of Washington since 1924, An office of student 
ratings is supported by the university and directed by a 
faculty member expert in the area of testing and counsel- 
ing. The office not only responds to faculty requests for 
evaluation but cohducis continuifig research on the evalua- 
tion process, 

■ f . ^ 

I’’ acuity participation in the program has been voluntary; 
results were made avahablc only to the individual teacher, 
but he might have them forwarilgd to his department chair- 
man Of dean if he chose. The* ten items on the current 
questionnaire were factored out of forty^one items most 
frequently mentioned In Uie extensive literature In the 
field. 

This year, the Faculty Senate recommended the adoption 
of additional procedures that will insure that every under- 
graduate class is evaluated at least once in threC’ years and 
perhaps oftener. These evaluations will be reported to 
department chairmen and deans and made available to 
departn^nt and college review committees. 

The University of Texas 

University of Texas' students publish one of the largest and 
most detailed of course evaluation booklets. The booklet is 
part of an extensive effort in the college of arts and sciences 
toward *a more .sy.stematic effort in teaching evaluation,’ It 
includes not only the statistical data from a questionnaire, 
but an extensive iampllng of iepresentative written com- 
ments from the students and a self-appraii^ of the course 
by the teacher, 

Dean John SUber is inviting each faculty member vdth 
tenure to participate in the evaluation at least oflce* every 
three years and those without tenure at least once a year. 
Teaching assis.tams and assodates w invited to participate 
at least once and p^slbly twice a year. Participants will 
have the option of releasing results to the dep^tment chdi- 
man, dean, or the student committee responsible for pub- 
lishing the tesults. 

University of Hartford^ University of Delaware 

Both of these universities have begun special efforts to 
improve teaching and to examine the contribution that 
evaluation, of teaching may make. The departments of 
Histor>', Politicil Sdenee, and Speech and Drama at the 
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University of Hartford are working with the project in 
, devising, administering, and evaluating Uie results of a stu- 
dent evaluation questionnaire. 

A University of Delaware faculty comniittce Mas finished an 
extensive examination of teaching and ways of improving 
it. The Senate has accepted the report, and the next step 
will be to get many of its excellent recommendations imple- 
mented. 

Hamline University 

ilarnline University reports a successful teaching Improve- 
ment pro|ram which involves annual evaluation. Associate 
Dean Kenneth Janzen attributes its success to 'a faulty 
which has Insisted on being involved in the process.of evalu- 
ation, has been willing to take tlie responsibility for devis- 
ing means for making tliat evaluation, and has been willing 
to assume the burden of making evaluations.’ 

All f ally members at Hamline are reviewed every year. As 
IKiii of that review, each faculty, member is required to 
submit student evaluations of three classes during the year 
or to be visited twice year by each of two visitors from an 
evaluation ^eam of eight experienced teachers. The faculty 
member, at his option, may also submit his own Self- 
cvaluation, and/or an evaluation of historic by a colleague. 
The most widely chosen option the student evaluation, 
thougli a number of faculty use all the evaJuaUon proce- 
dures. 

There have been recent efforts on a number of other 
campuses (e.g., the University of Toledo, New Mexico 
State University, Northern Illinois University, Western 
Kentucky University, and University of California at 
Davis) to identify characteristics of teaching that predict 
teaching effectiveness. In each of these efforts teaching 
effectivehess was not measured in terms'of what students 
had learned. Each of the studies focused on measuring 
teacher behavior in terms of traits and skills as such- 
divorced from the effect of those , traits and skills and 
student achievement. 

With reference to teaching traUs and skills (not teacher 
effectiveness), the above research tended to provide sup- 
port for the following: 

• Faculty and studentr tend to a^ee in their rating of 
teachers in terms of specific teaching skills. 



• The rnef^allocation of time among academic pur- 
suits and the performance of professional activities 
associated with teaching does not assure that in- 
struction is effective, 

• Ratings of competency in teacher functions in the 
classroom by faculty colleagues should be used as a 
supplement, not as a substitute, for ratings by stu- 
dents. 

• Student ratings of teaching assembled by faculty 
should be made available to the students; otherwise 
students can be expected to collect and disseminate 
their own data. 

• Relatively short student rating forms (15 to 40 
ilems) are preferable to longer forms. 

• Student rating forms should be used to supplement, 
but not to substitute for, other kinds of evaluations. 

There are many arguments for and against student evalu- 
ation. It may be helpful for future reference to note that 
the evidence supports the assertion that students are 
competent to measure the level at which the teaching 
skills are performed in the classroom. Do student evalua- 
tions have a positive and continuing effect upon improv- 
ing teaching? The eviderree is not conclusive. Eble notes in 
his Special Report (1970) that initially student evalua- 
tions eall attention to teaching and aid in the dialogue 
that could lead to improved teaching. At the same time, 
evaluation rnay create defensive postures that hkrm good 
Teaching and may even work specific harm oh certain 
- individuals. However, Eble notes that in the evaluations he 
• observed", many of the harmful effects were anticipated 
and guarded against, . , 

Eble also offers some oplniohs on the reliability of 
student evaluations and the advisability of using them. He 
states that althou^ undergraduate student evaluations are 
limited in the information they can provide, undergradu- 
ates do have competence to judge in some important 
matters and are able to sKed some light oh such matters. 
Jn addition, while student evaluations are limited, they 
probably are less limited than other means such as class- 
room visitation by colleagues.* 



IV. A MODEL TO EVALUATE TEACHER EFFECTIVENESS 



The study of teacher effectiveness is a search for the 
relationship between teach ^^ehavior and the effects of 
this behavior on the knowledge, feeling, and performance 
of students* Biddle (1964) points out the relationship 
between* teacher traits and teacher behavior and states it is 
unlikely* that significant advances will be made about 
understanding teacher competence* without a clearer 
picture of teacher behavior and its effects. Teaching effec- 
tiveness moves beyond the description of a teacher’s traits 
or the methods (teaching decisions.^strategies, etc.) used 
Jj'' a teacher, to the examination of these factors in rela- 
1 to what the student learns; in other words, the study 



of teacher effectiveness considers the extent to which the. 
purposes of teaching are realized. Therefore, teaching 
effectiveness means appraising the learning of students. A 
college student may develop an understanding about the 
structure of an atom; he may learn mathematical opera- 
tions and proofs; or he rnay acquire new intellectual and 
aesthetic interests. These acquired understandings and 
interests are examples of co^itive. and affective learnings 
that students attain at a university. Obviously, there are 
many other things learned that are beyond the presefibed 
curriculum. The university environment stimulates many 
opportunities for the informal cultivation of tastes. 
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knowledge and social roles. The question of teacher effec- 
tiveness, however, rests in the analysis of the progress 
students make in the acquisition of knowledge, and ways 
of thinking and acting that their instructors were trying tu 

teach, ^ ^ ^ 

In summary, anyone who sets out u; stu»ay tcacner 

effectiveness and to construct an instrument for the 
purpose of collecting data on teacher effectiveness must 
recognize that measures of effectiveness are directly re- 
lated to (he performance of students. Therefore, proce- 
dures established to guide the collection of information 
on teacher effectiveness would appropriately include ways 
of determining the results of a teacher’s efforts as ex- 
pressed in some form through the demonstrated knowl- 
edge and abilities of his or her students, 

Tyler (1960) has identified the generic tasks that are 
associated with obtaining a direct measure of teacher 
effectiveness. These tasks can be simply stated; however, a 
commitment to the tasks is corisiderably more involved. 
There are basically two generic tasks involved in obtaining 
a direct measure of teacher effectiveness: the first task is 
the clarification of the instructional objectives; and the 
second task is the measurement of the extent to which the 
instructional objectives/ have been achieved by the stu- 
dents. These tasks can and must be performed it the re- 
sults of the efforts of teachers are to be measured. 

Defining Instructional objectives 

The first difficulty faced by a university teacher, or a 
university faculty, is stating instructional objectives 
clearly enough in terms of behavior that the students are 
■ learning so that evaluators know what to assess when judg- 
ing the degree to which students have acquired ■ these 
behaviors. It is no secret that many statements appearing 
in college and university syllabi, course, outlines, or cata- 
logues are listings of content areas t^at the teacher will 
cover, not statements of the things students are expected 
to learn. Often students in different sections of the same 
course learn quite different things, even though the 
teacherS of the different sections use the same list of 
course topics. One teacher may have the students memo- 
rize certain dates, formulae, facts, names, and definitions; 
another might want students to discriminate among con- 
cepts in analyzing situations and Issues under^these topics; 
a third might want students to select an,d apply the neces- 
sary principles to solve problems; and a fourth might want 
students to develop an attitude toward the subject that 
will be expressed in a continuing interest beyond the 
course. Simply listing topics does not indicate which of 
the above or other kinds of behavior are teaching goals for 
any particular course, 

Tyler (i960) suggests that each teacher needs to 
* answer such questions as: 

Wh^t things am I re^y trying to get students to under- 
st^d? What kinds of abilities and .skills in^ thinking, 
analysis, probiem-solving* and the like am I feaUy trying to 
deyelop? What kinds of competence in reading, or writin 




or mathematical operations .am I actually, seeking to help 
students to acquire? Am I trying to arouse certain intel- 
lectual and aesthetic interests, arid, if so, what are they? 

Am I trying to develop some study habits or practices that 
will aid the student in his continuing developnnent. and, if 
so, what ones? Am I seeking to instill certain attitudes, 
appropriate to understanding or appreciation of phenomena 
in this field, and, if so, what are they? 

Until teachers can state in clear terms the different types 
of student behavior they wisli students to acquire through., 
their teacliing, a sound evaluation of their teaching cannot 
be made because the evaluators will have no cormnun 
criteria for judgment. 

Evaluation requires not only a knowledge of stu^ 
dents’ behavior that teachers wish their students to 
acquire, but each kind of behavior must be clearly des- 
cribed so that it can be determined when the student has 
attained that objective. Teachers often say they are not 
trying merely to teach facts or simple skills, but are trying 
to develop “understanding” of the topic. However, many 
teachers are hard pressed when asked what they mean by 
“understanding” when they attempt to differentiate it 
from memorization of data. Other teachers have difficulty 
defining what is meant by such terms as “aware of, 
“appreciation for” etc. For example, some teai^hers 
believe that students have art “understanding” ot certain 
concepts and principles if they can describe them in their 
own words; or if they can distinguish among examples of 
them; or if they apply them in analyzing and solving prob- 
lems. Teachers might accept different student behavior for 
an expression of “understanding”; yet, if they are able to 
state their objectives as such, these statements can be used 
as a basis for evaluation, since it is then known what is 
being looked for. To determine how effective teaching has 
been in the development of “understanding,” each in- 
structional objective should be similarly stated. 

It is important to keep in perspective the role .instruc- 
tional objectives g^lay in the total educational experience 
of students. The objectives for a course must support the 
instructional objectives (goals) of the department. In simi^ 
lat manner, the department’s objectives must support the 
objectives of thp division, assure that the students in 
different sections of a course acquire the necessary com- 
petence to continue work toward achieving objectives - 
established by the department and reflected in other 
courses, it is necessary that the teachers of different sec- 
tions agree upon instructional objectives for the course— 
terms of observable student behavior . Only by such agree- 
ment wUl students in all sections have the prerequisite 
competencies to succeed in the next course that supports 
the objectives of that department. Thus, besides the need 
for course objectives to be stated in terms of obser^ble 
student behavior, If is importartt that the objectives be 
agreed upon by colleagues who are teaching different sec* 
tions of the same course. Obviously, if a departments 
objectives are stated iii terms of what the students will be 
able to do as a result o^ the department’s efforts, the task , 
of colleagues reaching a^eement on instructional objec- 
tives of a course is- made ^sier. 

.A 
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After the individual instructor has formulated agreed- 
upon course objectives, the second step is to select ways 
of determining whether or not students have attained the 
competencies described in the instructional objectives. 
The means for gathering evidence of student achievement 
in large part will be determined by the wbjectives of the 
course; that is, the desired behavior of the students that 
has been described by the teachers as reflecting “undcr- 
sianding/’ Althougli evidence of student understanding 
in some disciplines can be obtained by means of paper 
and pencil tests, these means will not be satisfactory in 
other disciplines. However, in ail disciplines it is highly 
important that the tests (term papers, recitals, lab reports, 
etc.) actually require the students tu exhibit the type of 
competence that is described in the instructional objec^ 
lives. If “understanding” in the instructional objectives is 
defined as the student being able to select and apply the 
necessary concepts and principles for solving problems, 
then the tests would not have performuice agreement 
with the fhktructional objectives if they require the stu- 
dent to be able to select and name the necessary concepts 
and principles, 

Tyler (I960) speaks to this point in the evaluation of 
teacher effectiveness: 



The evaluation of the results of teaching requires more than 
one appraisal of the students' behavior. Learning is the 
acquisition of new ways of behaving. It involves changes in 
the students' reactions. Hence, we shall need at least two 
appraisals', one toward the beginning of a course or the 
college experience and another toward the end in order to 
see what changes have taken place and thus to estitnate the 
effectiveness of the teaching in stimulating and developing 
learning. Furthermore, many of us are interested in the 
pernnanency of learning ajpd>^e shall, therefore, want to 
make a further appraisal the students' behavior some 
time after the course is completed, or after they have gradu- 
ated from college, to find out the extent to which tiie 
learning- which was noted while the students were taking 
the course or were in college is stUl evident some time later 
or whether a considerable amount has gone. 



Options for teacher evaluation 



These are at least three options for iniiial action in 
proceeding to determine instrumentation and procedures 
to guide the study of teacher effectiveness at the univer- 
sity or college: 



1. Direct Measurement of Teacher Effectiveness. This 
option requires that attention be given to the collection of 
•data about class performance level. Parformance measures 
such as formal testSi interpretive papers, creative efforts, 
etc., would be used as. criteria. The review of these efforts 
would be made by the teacher’s coUeagues in accordance 
with procedures established by the department. Clearly, 
student efforts must be considered in terms of classroom 
goals, and the relative sophistication of the students with 
whom the instructor^ has been working should also be 
taken into account. * 






A plan that immediately moves to a direct measure 
teaching effectiveness, however, may not be practical. 



Before it can be applied, the faculty of each department 
must do a great deal of study and decisionmaking about 
the program. Agreement on total program and the objec- 
tives of courses within the programs are needed; also, the 
means by which initial information on student perform- 
ance is obtained must be designed, along with the means 
for determining the competence of students at the end of 
the course. The extensiveness of the direct-measure 
method make^ it prudent to suggest that this approach be 
seen as a long-range goal to be used initially in conjuctioii 
with other options. 

2 . The Construction of a Student Rating insirument. 
The second option involves the use of a student rating 
form as the primary source of data. This option has most 
frequently been selected by universities in the past se .cral 
years. Student rating forms and instrumentation, as des- 
cribed earlier, are used as indirect measures of teaching 
effectiveness. Such ' instruments are typically used to 
record student perceptions on a host of teacher traits, 
teaching methods, and class organization, as well as taking 
inventory of the attitude of feelinp of the student(s) 
about the course. It has been stated that these concerns 
are not of themselves direct measures of effectiveness, but 
may be used as hi^-inference indirect measures. The 
research on the use of indirect measures as predictors of 
teacher effectiveness, is, at best, inconclusive. However, in 
cases where instruments were constructed in accordance 
with the purposes of the evaluation process, and where 
there was participation by both instructors and student 
representatives in this effort, the use of student rating 
forms turned out to be a constrbetive factor in teacher 
evaluation. Furthermore, instrut^tors gain the benefit of 
student thinking, and students have been described as 
taking the process seriously. Beyond an inveuiory . of 
student perceptions about individual instructors or 
courses, student ratinp may offer a faculty important 
information about its progress<:)in articulating Its own 
predetermined program goals. 

The analysis performed at the University of Cali- 
fornia, Davis, included a factorial analysis of rating forms, 
and concluded that student rating forms are usually 
comprised of items centering around five major factors, 
noted earlier: Analytic/Synthetic, Organization/Clarity, 
Instructor-Group Interaction, Instructor-Individual Stu- 
dent Interaction, and Dynamism/Enthusiasm. These cate- 
gories can be used as an instrument for student rating of 
teachers by any college or university; or categories can be 
made up from a variety of sources to reflect the response 
modes that meet a particular institution’s requirements. 

Equally important is the development of procedures 
and guidelines for the use of the instrument and the pur- 
poses of sudi a measure. This step would allow student 
involvement at the level of an indirect measure of teaching 
. effectiven^s and would provide a description of teachers’ 
traits and skdls as well as a decription of the attitudes and 
feelings of the students about the courses. , 

3* The Combination of Student Ratings and the 
Review of Student Products. Tliis third option incljjdes 
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the design and construction eft a student rating instru- 
ment, as well as the promotion ot the student perform- 
ance concept and explicit statements as to the proposed 
use of the findings. Tltis option would be considered the 
first phase of a long-range effort to implement Option 1: 



K 

o 

ERIC 



• Design of a student rating form An Instrument 
should be designed to enable students^d faculty to 
record student perceptions about^^heir courses and 
teacher performance. This instrument would be con- 
structed while an institution considered the more 
iong=range implications of Option 1 . 

• Communication of the concept of direct measure^ 
men^t of teacher effectiveness - The concept of 
teacher effectiveness, defined as the study of the 
outcomes of teaching, would be fully examined and 
communicated in its intent and meaning to the aca- 
demic divisions of an institution. Since a long-range 
study of teaching effectiveness wouTd require depth 
analysis of curriculum by the departmental and pro- 
gram areas, faculty groups would be provided with a 
set of guide lines and a list of instructional resources 
to aidihem in defining programs in terms of student 
learning behaviors. This option would include initial 
efforts by the faculty of each department to 
identify its desired student outcomes in reference to 
the division’s goals. In addition, each department 
would determine its own methods of appropriating 
a d examining student efforts, and these methods, 
in turn, would be communicated to the division 
chairman. A study of the range of approaches 
formulated by each division will aid the develop- 
ment of institutional guidelines for a direct, 
outcome-oriented approach to teacher effectivene^. 

• Use of findinp - In the best sense, student ratings 
of faculty and faculty measurement of the out- 
comes of teaching must provide the conditions for 
teaching improvement. Hence, this third option 
would include a policymaking body — consisting of 
faculty and students -- to determine how to use 
these findinp. A necessary, but not sufficient condi- 
tion for establishihg the criteria for teacher evalua- 
tion is the continual commimication among the 
groups involved* ll' such communication is not main- 
tained, any of the evaluation procedures could 
become a divisive factor. The need for a^^ood start 
and the comple3dty of teacher evaluations suggest 
that Option No. 1 shoidd be the^primary goal, with 
Option No. 3 as the practical dioice for an institu- 
tion’s first-year effort* When implementing Option 
No.‘ 3, an institution should simulfeneously initiate 
efforts toward Option No. 1, The following recom- 
mendations to implement a Idni-range program 
similar to Option 1 are submitted with jthe under- 
standing that their implementation would require 
considerable faculty and student effort- *a prerequi- 
site to direcftly measure teacher effectivenjess* 



Recommendations fo? long-range direct measure^ 
ment of teaching effectiveness 

Five steps are recommended for the long-range study 
of teaching effectiveness. These steps are summarized as 
follows: 

• Each discipline or program area develop a bqmpre- 
hensive statement of its program. 

• The faculty in the various disciplines devise a clear 
statement about the range of pertormance expecta- 
tions held for students in the faculty’s component ^ 
of the program. 

• With the dear statement of objectives as a reference 
point, the faculty of a discipline establishes consen- 
sus* as to general and specific definitians of their 
pro^am. 

• Obtain baseline data via a premeasure when students 
enter a course. 

• Measure the growth of students in a given course by 
the objectives of the course. Use -this measure of 
student achievement as a direct measure of teacher 
effectiveness. 

The implementation of these steps would take the fol- 
lowing form: t 

1 , Comprehensive statement of progranL Each disci- 
pline or program area would develop a comprehensive 
statement of its program. This includes both a general 
affirmation of departinental and program goafs, as well as 
a more detahed definition of the contribution to be made 
by each component to th^. total pro^am* This step should 
j provide a clear understanding of the aims of the discipline, 
as well as the expected contribution of courses, field 
work, senunars, and Independent study arrangements. 
This information would be available for the analysis of 
‘ faculty, students, and administration, 

2* Clearly stated objectives for each course. The gen- 
eral definition of a discipline’s program is an important 
step, but is not complete enou^ to enable the evaluation 
of teaching in terms of teacher effectiveness to proceed. 
Consequently, the faculty in the various disciplines and 
pro^ams ^ould continue to wojk on more specific in ter- 
’ pretations of each course or program component for 
which faculty members are responsible. This step would 
•include the itenuzation of course content in a more exact 
forrt than that usually distributed to itudentsin course 
outlines. TWs step also Includes descriptions of what stu- 
dents wUl be able to do as they complete parts of or the 
complete prograhi component. In this way, faculty and 
students will have a common basis for understanding the 
inclusion of a particular course within a program. Both 
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students and taculiy will benetlt iroin u clear statement 
about the range of per|ormance expectations held for stu- 
dents as defined for that component (course, etc.) by the 
faculty. 

This procedure may seem overly complex; even 
’.vorse, it may seem an abr idee men i ot the instructor s 
freedom in tlie development of his teaching. From 
another perspective, however, the completion o^program 
definitinn. includitig specific Vourse elements and per- 
■formance expeciaiions for stud^its, may actually increase 
opportunities tor creative teacliing. AH instructors will 
not, not ibiu’iuid they be expected to teach in the same 
way. Given rather specific common^ agreement among 
faculty as to the purpose and composition of a course, 
individual faculty may devise widely divergent approaches 
to inslniction. Certain students, for example, will demon^ 
strate great competence in a course in the early stages of 
instruction. The recognition of gifted students as well as 
those who need help, when devising the basic elements of 
a course, will enable the teacher to make better decisions 
for the class both as a group and as individuals. Precise 
program definition and planning is not a deterrent to crea- 
tive teaching and is clearly not inimical to student learn- 
ing. The general definition of a departmental program as 
well as a more specific course or program coniponent 
structure is never a completed process. The stated pro= 
gram is a guide and should be viewed as a tentative defini- 
tion m things; that is, as experience is gained with a 
program, and as additional students and faculty aje added, 
it becomes necessary to constantly assay the meaning and 
relevance of the df pajtmental program. This reassessment 
should proceed in terms of overall goals (e.g., to prepare 
students for the medical professions, research in industry, 
or graduate work, etc,), as well as in terms of the compo^ 
nent courses that comprise the program. Program defini- 
tion is not immutable, but a base which is subject to 
change. It does, thou^, become the stated intention of 
the departmental faculty, and it is gainst these stated 
intentions that faculty effectiveness be reasonably 

studied. 

3. Faculty consensus about program definition. The 
faculty of a discipline must arrive at a consensus ab-ut the 
general and specific definitions of their program. Most 
disciplines or program areas have curricular comnuttees 
that serve the purpose of stimulatiftg curricular change, 
revising course proposals, and many othrer related matters. 
Instructors should contribute to and be informed about 
the program in effect and should be made aware of the 
procedures to be followed if they decide to change the 
focus or structure, of a course. This kind of communica- 
tion is irriportant when judgments on teaching effective- 
ness are related to a teacher’s ability to . nurture certain 
kinds and levels of student performance. Again, the defini- 
tion of “program** should be flexible: in a college or 
university there should be ample opportunity for Innova- 
tive approaches. Wiere a basic and substantive change is 
to be acted upon, however, it is reasonable that such a 



change should be discussed iivthe hope that consensus can 
be reached about the implications ot the change vis-a-vis 
the department's lutul program. Consensus among the 
faculty regarding its progr^i provides the teacher with 
inforniiition about depart mclital expectations in his area 
of instruction. 

4. Obtaining baseline data. Since there will be o range 
of ability and background represented among students in 
different classes, an assessment should be made ot the 
range of student competence in a teacher’s classes. It 

is expected that in time this variable would even out, so 
that classes would reflect the general range ot ability 
present in the institution's student population. If the 
teacher’s effectiveness is viewed in regard to the imelluc- 
tual growth of students, then there must be a system 
designed to gain information about the level of compe- 
tence at which students begin a particular course. In a 
sens^? the mechanism designed to elicit this kind of 
information becouies a premeasure; and the nature of the 
premeasure, whether it be a pencil and paper 6xercise(s) 
or a demonstration of some type, is best deterrpined by an 
examination of the goals of a course and the nature of the 
discipline of which it is a part. The purposes of the pre- 
measure are: (1) to provide the teacher with information 
on the background of his students; (2) to help the teacher 
make instructional decisions that are appropriate to his 
students; and (3) to establish baseline data in the measure- 
ment of student progress in the course. If a group of 
teachers are teaching diffe4-ent sections of the same 
course, the same premeasuii. could be employed. The 
information collected on the premeasure would provide 
helpful data on the range of student abilities, as well as 
indicate the starting point from which a class could begin 
its study. The design of the premeasure should be deter- 
mined by the instructors who teach that part of the pro-, 
gram; a variety of opinions could be gathered to increase 
the likelihood that the premeasure would correspond to 
all major course aims, 

5. Measurement of student achievement. An assess- 
ment of the change in student competence should be 
obtained either at the end or at pivotal points during a 
given course. Using the baseline data obtained by premea- 
sures, the effectiveness of instruction can be measured In 
terms of the extent to which student performance has 
changed or in terms of the, extent to which students have 
learned what a teacher intended to teach. A procedure for 
measurement of such change has been detailed^ by Cook 
(1971) in Direct Measurement of Teaching Effectiveness: 
4 Chaiienge. Agreement about performance criteria using 
the objectives originally established as goals for the course 
and the assessment measure (fie., postmeasure) is critical 
when measuring teacher effectiveness. Group efforts simi- 
lar to those related to the premeasure are recommended. 
If 'coileaguBs teaching the same course agree upon the 
objectives of the course, their agreement on the assess- 
ment items would be si^ificant. Once the data describing 
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the iniellectual growth of students is available, the 
teacher's' effectiveness may be established— taking into 
consideration the variables pertaining to the data. 

Implementation ot these and oilier complementary 
steps requires a commitment on the part of the university 
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community-students, faculty, and adniinistialion- of sig- 
nificant dimensions; however, the result of such a commit- 
ment could be rewarding beyond the dreams of present 
teacher evaluation committees. 
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